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(54) Tltie: METHOD AND APPARATUS FOR REDUCING POOL STARVATION IN A SHARED MEMORY SWITCH 




(57) Abstract: A switch includes a reserved pool of buffers in 
a shared memory The reserved pool of buffers is reserved for 
exclusive use by an egress port. The switch includes pool select logic 
which selects a free buffer from the reserved pool for storing data 
received from an ingress port to be forwarded to the egress port The 
shared memory also includes a shared pool of buffers. The shared 
pool of buffers is shared by a plurality of egress ports The pool 
select logic selects a free buffer in the shared pool upon detecting 
no free buffer in the reserved pool . The shared memory may also 
include a multicast pool of buffers Ihe multicast pool of buffers is 
shared by a plurality of egress ports The pool select logic selects a 
free buffer in the multicast pool upon detecting an IP Multicast data 
packet received from an ingress port 
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METHOD AND APPARATUS FOR REDUCING POOL STARVATION 
IN A SHARED MEMORY SWITCH 

BACKGROUND OF THE INVENTION 

A networking switch receives data packets at ingress ports and foiwards the 
5 data packets to one or more egress poits. The switch determines the egress port or 
ports to which the data packets are foiwarded dependent on a destination addr ess 
included in the data packet . A data packet received from an ingress port is stored in 
memory in the switch before being foiwarded, 

The memoiy in the switch may be a shared memory in which all r eceived 
1 0 data packets are stored befor e being foiwarded As the received data is stored in the 
shared memory, a data packet pointer ; that is, a pointer to the location of the 
received data in shared memoiy is stored in one or more queues , The data packet 
pointer' may be queued dependent on the ingress port at which it was received or the 
egress pott at which it is to be transmitted, An output buffered switch queues a data 
1 5 packet pointer dependent on the egress port at which it is to be tr ansmitted.. 

The shared memory is shar ed by all the egress poits and a location is 
randomly allocated to an egr ess port for storing a r eceived data packet to be 
foiwar ded to the egr ess port. The number of data packets transmitted thr ough each 
egress port can vary dramatically resulting in temporary congestion in a particular 
20 egress port, Temporary congestion may occur- at an egress port if data packets 
received from a plurality of ingress poits are to be forwarded through Are same 
egress port or if there is a speed mismatch between the ingress port and the egress 
port, for example, the ingress poit receives data at 1 Gigabits per' second and the 
egress port transmits data at 100 Megabits per second, 
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SUMMARY OF THE INVENTION 

A shared memoiy provides more memory for use by a congested egress port 
but all the shared memoiy may be consumed storing data for a congested egress port 
leaving no memoiy available for stoiing data to be forwarded to non-congested 
5 egr ess ports This results in data packets being chopped for non-congested egress 
ports, 

In accordance with the invention, a shared memoiy switch including a 
reserved pool of 'buffers in the shared memoiy is presented The reserved pool of 
buffers is reserved foi an egress port. The switch includes pool select logic which 
10 selects a free buffer from the reserved pool for storing data received from an ingress 
port to be forwarded to the egress port. 

The shar ed memoiy also includes a shar ed pool of buffer s„ The shared pool 
of buffers is shared by a plurality of egress ports Upon detecting no free buffer in 
the reserved pool, the pool select logic selects a free buffer in the shared pool, 
15 The shared memoiy may also include a multicast pool of buffers The 

multicast pool of buffers is shared by a plurality of egress ports , The pool select 
logic selects a free buffer in the multicast pool upon detecting an IP Multicast data 
packet received from an ingress port . 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 The for egoing and other objects, featur es and advantages of the invention 

will be apparent fr om the following mor e particular description of preferred 
embodiments of the invention, as illustr ated in the accompanying dr awings in which 
like reference characters refer to the same parts throughout the different views,. The 
drawings are not necessarily to scale, emphasis instead being placed upon illustrating 

25 the principles of the invention,, 

Fig . 1 is a block diagr am of a shared memory switch; 

Fig., 2 is a block diagram of 'the shared memory shown in Fig. 1 illustrating the 
reserved pools, shared pool and multicast pool; 

Fig., 3 is a block diagram of the pool select logic in the packet storage manager 
30 shown in Fig.. 1; 
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Fig. 4 is a block diagram of the pool counters shown in Pig 3; 

Fig 5 is a flowchart illustrating the steps for selecting a pool implemented in pool 

counter logic shown in Fig . 3; 

Fig . 6 is a flowchart illustrating the steps for retaining a buffer to a pool 
5 implemented in pool counter logic shown in Jig , 3 . 

DETAILED DESCRIPTION OF THE INVENTION 

A description of preferred embodiments of the invention follows. 
Fig, 1 is a block diagram of a shared memory switch 100, All data received 
on ingress ports 102 is stor ed in shared memory 108 before being forwarded to one 
10 or more egress ports 1 12 0 _ n . The packet storage manager' 106 controls write and read 
access to the shar ed memory 108., The packet storage manager 106 includes pool 
select logic 124 . The pool select logic 124 logically portions the shar ed memory 108 
into a shared pool, a multicast pool and a plurality of reserved pools as illustrated in 
Fig 2 . Ther e is no actual physical portioning of the shar ed memory 1 08 
1 5 The pool select logic 124 manages the buffers in each pool by selecting a 

pool from which to allocate a buffer'. A buffer' is allocated from the egress port's 
reserved pool if the number of allocated buffers for' an egress port 1 12 is less than a 
reserved threshold count. The pool select logic 124 is described later in conjunction 
with Fig. 2 Read and write access to the shar ed memory 108 is described in co- 
20 pending U. S . Patent Application Serial Number 09/386,589 filed on August 31, 
1999 entitled "Method and Appar atus for an Interleaved Non-Blocking Packet 
Buffer" by David A, Brown which is incorporated herein by reference in its entirety 

The switch 100 includes an ingress ports engine 104 and an egress ports 
engine 1 10, A data packet is received serially at an ingress port 102 The ingress 
25 engine 104 detects and processes headers in the r eceived data packet, determines 
from the detected header on which egr ess port 1 12^ to forward the data packet and 
gener ates a forward vector 1 1 14 for' the data packet, The forward vector- 1 14 is a bit ■ 
map, with a bit corresponding to each of the plur ality of egr ess ports 1 12 0 _ rt , 
indicating whether the data packet is to be forwarded to that egress port 112^.. A 
30 method and apparatus for generating a forward vector 1 1 14 is described in co-pending 
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U..S . Patent Application Seiial Number Serial Number 09/453,344 filed December 1, 
1999 entitled '"Method and Apparatus for Wire-Speed IP Multicast Forwarding" by 
David A., Brown, the entir e teachings of which are incorporated herein by refer ence 
in its entirety. The forward vector 1 14 is forwarded to the packet storage manager' 
5 106. 

The packet storage manager 106 provides shared memory addresses 122 for 
read and write operations to the shar ed memory 108 and stor es the locations in the 
shar ed memory 1 08 of each data packet stored in one or more egress port queues 
(not shown) , A method and appar atus for providing egress port queues is described 
10 in co-pending US Patent Application Serial Number' 09/559,190 filed on April 27, 
2000 entitled "Port Packet Queuing" by Richard M„ Wyatt incorporated her ein by 
reference in its entirety., The egr ess engine 110 through select 120 selects one of the 
plur ality of egress ports 1 12 0 .. n5 on which to tr ansmit a data packet and provides tire 
stored data packet to the selected egress port 1 12 0 _ a , 
1 5 The shared memory 1 08 stores data packets received from any of the ingress 

ports 102 for transmitting to any of 1 the egress ports 1 12 0 _ n „ The switch 1 00 provides 
concurrent processing by the ingr ess port engine 104 of data packets received at 
ingr ess ports 102 and pr ocessing of stor ed data packets by the egress port engine 110 
for' forwarding on egress ports 1 12 0 , n , 

20 As a data packet is received serially on one of the ingr ess ports 102, the serial 

data may be grouped into data segments , Each data segment is stor ed in a buffer 1 in 
shared memory 108, Thus, the size of a buffer is dependent on the size of a data 
segment . The size of a data segment is dependent on the networking protocol 
supported by the switch 100,. For- example, for 1 the Ethernet networking protocol, the 

25 size selected for 1 the data segment is 64 bytes because the minimum data packet size 
for' an Ethernet data packet is 64 bytes , 

The ingress ports engine 104 also determines from the header" included in a 
received data packet if a data packet is an IP Multicast data packet . The switch 100 
forwards a copy of an IP Multicast data packet to each member- of the IP Multicast 

.30 group., A single copy of the data packet is stored in shared memory 1 08 with a 

Multicast Forwarding Index ("MFF') as is described in co-pending US - co-pending 
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U.S., Patent Application Seiial Number Serial Number 09/453,344 filed December 1, 
1999 entitled 6< Method and Apparatus for Wire-Speed IP Multicast Forwarding" by 
David A, Brown, which is incorporated herein by reference in its entirety Members 
of the IP Multicast group may be connected to the same egress poit 112 or to more 
5 than one egress port 1 12,. Multiple copies of the IP Multicast data packet are 
foi warded to an egress poit 112, one for each member of the IP Multicast gr oup 
connected to the egress port 1 12 If the received data packet is an IP Multicast data 
packet, the ingress ports engine 104 forwards a multicast tag 126 with the forward 
vector 114 indicating that the received data packet is an IP Multicast data packet., 
10 The multicast tag 126 indicates whether- the received data packet is an IP Multicast 
data packet . The multicast tag 126 can be implemented as a single bit set to T if the 
received data packet is an IP Multicast data packet 

Fig,. 2 is a block diagram of the shared memory 108 shown in Fig 1 logically 
portioned into a shared pool 200, a Multicast pool 204 and a plurality of reserved 
15 pools 2O2 0 - 202 n „ A reserved pool 202 0 - 202^ is provided for each egress port 

1 12 0 „ n , A reserved threshold count of buffers is assigned to each reserved pool 202 0 - 
202 n .. The reserved threshold count of buffers may only be allocated for storing data 
packets for the r espective egress port 112, The shar ed pool 200 is provided for 
storing data packets to be forwarded to any of the egress ports 1 12^., The Multicast 
20 pool 204 is provided for storing IP Multicast data packets to be forwarded to any of 
the egress ports 112 0 . n , 

A buffer is allocated from the egress ports reserved pool 202^ while the 
count of allocated buffers is below the threshold count., If the number of reserved 
allocatable buffers has been exceeded, buffers are allocated from the shared pool 
25 200, A buffer for a data packet can only be allocated from the r espective reserved 
pool 202(j - 202 n associated with the egress port 112^. Thus, even if all the buffers 
in the shared pool 200 ar e allocated to one congested egress port 1 12^, a buffer' is 
available for allocation to a non-congested port from the non-congested port's 
respective reserve pool 202 0 - 202 n< , 
30 If the data packet is an IP Multicast data packet, the data packet is stor ed 

once in shared memory 108 and a copy of the stor ed data packet is forwar ded to 
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members of the IP Multicast group,, The IP Multicast data packet is stored in shared 
memoiy 108 until the IP Multicast data packet has been forwarded to each member 
of the IP Multicast group., 

As the stored IP Multicast data packet may be fot warded to more than one 
5 egress port 1 12, it is difficult to select a port reserved pool 202 from which to 
allocate a buffer in which to store the IP Multicast data packet . Also, without 
keeping separate accounting for IP Multicast data packets, it cannot be determined 
when an IP Multicast data packet is being dequeued whether' it is from a reserved 
pool 202 or- a shared pool 200 because the buffer may not have been allocated from 
10 the r eserved pool for the egress port to which the last IP Multicast data packet is 
forwarded . Having a separate pool for storing IP Multicast data packets allows a 
switch 100 to limit the number of buffer resources that can be consumed for' stoiing 
IP Multicast data packets.. Thus, a Multicast pool 204 is provided to stor e IP 
Multicast data packets , 
15 fig. 3 is a block diagram of the pool select logic 124 in the packet storage 

manager 106 shown in Fig., 1., The pool select logic 124 includes pool counters 300 
and pool counter logic 302 , The pool counters 300 include a counter for each pool 
200, 202 0 -202 n , 204 in shared memoiy 108 . The counter 1 status 304 is forwarded to 
the pool counter' logic 302.. 
20 The pool counter logic 302 selects the pool 200, 202 0 -202 n or 204 from 

which to allocate a buffer dependent on the state of counter status 304, the forward 
vector 114 and the Multicast tag 126 for' the received data packet to be stored in 
shared memory 108, After selecting the pool 200, 202 0 -202 n or 204, the pool 
counter' logic 302 generates counter increment-decrement 306 to increment or 
25 decrement the respective counter in pool counters 300, The pool counter logic 302 
also generates buffer_available 308,. The state of buffer_available 308 indicates 
whether a buffer 1 is available for storing a received non-Multicast data packet in 
shar ed memory 108., If no buffer is available for' a non- Multicast data packet in the 
shared pool 200 or the port reserved pool 202, or for a Multicast data packet in the 
30 Multicast pool 204, the received data packet is dropped. 
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If a buff er is available in which to store the received data to be forwarded to 
an egress port, a buffer is allocated from a free list of buffers in shared memoiy 108 
as described in conjunction with co-pending U.S , Patent Application Seiial Number 
09/386,589 filed on August 31, 1999 entitled "Method and Apparatus for an 
5 Interleaved Non-Blocking Packet Buffer" by David A Brown which is incorporated 
herein by refer ence in its entir ety . 

Fig. 4 is a block diagram of the pool counters 300 shown in Fig. 3 . The pool 
counter s 300 include a shared free counter 400, a total free counter 402, a Multicast 
free counter 41 6, a reset ved free counter 428, a queue reserved thr eshold register 
10 412, and a port reserve pool counter 414 0 _ n for each egress port 1 12 0 „ n in the switch 
100, 

The total free counter 402 stores the total number of free buffers in shared 
memory 108., The total free counter 402 is preset with the sum of the buffer s in the 
shared pool 200, the port reserve pools 2.G2 (r 202 n and the Multicast pool 204 . The 

15 total free counter 402 is decremented each time a buffer is allocated from any pool to 
store a data packet., If 'the pools are assigned as follows: reserved « 500; shared = 
2000; Multicast = 500, the sum of the buffers is 3000 equal to the total number of 
available buffets IP Multicast data packets are dropped once the 500 data packets in 
the Multicast pool 204 ar e used even if there ar e buffers available in the shared pool 

20 200,. 

Ther e is a certain amount of "tuning" requir ed in a switch . The taring 
includes determining how to divide the shared memory 108 to reduce the probability 
of dr opping data packets during congestion. Most congestion occur s when there ar e 
bursts of data colliding together at an output port , Depending on factors such as 

25 burst length and others, the dr op pr obability varies dependent on the over' 

subscription ratios of the pools in shared memory 108. Thus, the thresholds for the 
pools in shared memory 108 are modifiable dependent on expected load conditions., 

In an alternative embodiment, the pools 200, 202 0 -202 n and 204 are 
over subscribed. The total free counter 402 is initialized with the total number 1 of 

30 physical buffers in shared memory 1 08 , The sum of 'the buffer's in the shared pool 
200, the port r eserve pools 202^200^ and the Multicast pool 204 is greater- than the 
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count stored in Hie total free counter 402 . This allows for better utilization of 
buffers in shared memory 108, but reduces the guarantee that there will be a buffer 
available in the poit reserve pool 202 

For example, if the pools ate assigned as follows: reserved pool ^1000; 
5 shared pool = 2000; Multicast pool = 1 000, the sum of the buffers is 4000 but the 
total number- of available buffers in shared memory 108 is 3000; that is, less than the 
sum of the buffers in the pools. This allows better utilization of the buffers in shared 
memory 108 if there is more Multicast data tr affic than unicast and broadcast data 
traffic through the switch 100 because one third of the total number of available 

1 0 buffers may be allocated for storing IP Multicast data packets . 

If the Multicast pool 204 is not oversubscribed, Multicast data packets ar e 
dropped when the Multicast pool 204 is full even if there ar e buffers available in 
shared memory 108 Thus, with over subscription, it is less likely that data packets 
are dropped when there are buffers available in shared memory 1 08. The 

15 disadvantage is that a buffer may not be available in the r eserved pool for a non- 
congested port even if the respective port r eserved pool 202 is below its reserved 
threshold 

The shar ed free counter 400 stores the number of available or flee buffers in 
the shar ed pool 200 , Buffers in the shared pool 200 of shared memory 108 may 

20 stor e a data packet for any egr ess port 112^.. The shar ed See counter 400 is 

decremented each time a buffer is allocated from the shared pool 200 , Shared pool 
count 430 indicates whether there are fiee buffers in the shared pool 200.. The 
shar ed pool count 430 is forwarded to pool counter logic 302,. 

The Multicast fiee counter' 416 stores the number of available buffers in the 

25 Multicast pool 204 , The port reserve pool counter s 414 0 _ n store the number of 
buffers allocated to the respective egress port 1 12 from the respective port reserve 
pool 202 0 -202 n and the shared pool 200 A queue reserved threshold register 412 
stores the number of buffers reserved for allocation to a port reserve pool 202 0 - 
202 a . One queue reserved threshold register- 412 is provided if the same number of 

30 buffers is r eserved for- allocation to each port reserve pool 202 0 - 202 n , In an 
alternative embodiment each port reserve pool 202 0 - 202 n may have a different 
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number of buffers reserved for allocation and thus a queue reserved threshold 
register 412 is required for each, poit reserve pool counter' 4 14 ft .„. 

The reserved fiee countei 428 is preset to the sum of the buffers reserved foi 
each egress port 1 12 or is preset to a number less than the sum. If the reserved free 
5 counter is preset to a number less than the sum, more reserved memory is provided 
per egress port but an egress port may not be able to allocate all of the buffers in its 
respective r eserved pooh For example, if ther e ar e four' reserve port pools each with 
100 buffers reserved and the total reserved is set to 400.. Upon receiving a data 
packet to forward to an egress port, the packet is dropped if there are no free buffers 
1 0 in the shar ed pool and the egress port has no fiee buffers in its r espective reserve 
port pool even if there are fiee buffers in the reserved pool . If each reserve port pool 
is oversubscribed to 150, each egress port may use up to 150 buffers of the 400 
buffers in the reserved pool Thus, more buffers are available for a congested port , 
However, by oversubscribing, an egress port is not guaranteed allocation of its 
15 reserved pool of buffers, For example, in a switch with 4 ports and each reserve port 
pool oversubscribed to 150, with one congested port using 150 buffers, there are 250 
buffers available for allocation to the other 3 ports , 

Each of the counters 400, 402, 414^ 416, and 428 has a respective counter 
increment-decrement signals 422, 404, 426^, 410 and 436 generated by the pool 
20 counter logic 302 and forwarded on counter' increment-decrement 306 (Fig.. 3). A 
counter' 400, 402, 414 0 . n> 416, and 428 is incr emented or decremented each time a 
buffer is allocated or freed fiom a respective pool. For' example, the shared fiee 
counter 400 is decremented each time a buffer is allocated from the shar ed pool 200 
and incremented each time a buffer is fieed back to the shared pool 200. The total 
25 fiee counter 402 is decremented each time a buffer is allocated from any pool and 
incr emented each time a buffer is fieed to any pool 

Fig. 5 is a flowchart illustrating the steps for' selecting a pool 200, 202 0 -202 n 
204 in shared memory 108 fiom which to allocate a buffer implemented in pool 
counter logic 302 shown in Fig. 3 Fig 5 is described in conjunction with Fig. 3 and 
30 Fig, 4. 
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At step 500, the pool counter logic 302 waits for notification of a received 
data packet to be stored in shared memoiy 108 , The notification of a received data 
packet includes a foiward vector 1 14 and a multicast tag 126 foiwaided from the 
ingress poits engine 104. The foiward vector 114 indicates the egress poits 112 0 . n to 
5 which the received data packet is to be foiwaided . The Multicast tag 126 indicates 
whether the data packet is an IP Multicast data packet. If a data packet is received, 
processing continues with step 502., If 'not, processing continues with step 500, 

At step 502, the pool counter logic 302 determines if there is a buffer 
available in the shared memoiy 108 for stating the r eceived data packet., A buffer is 
10 available in shared memoiy 1 08 if total free 408 is greater than zero, Total free 408 
is the count of free buffers stored in the total free counter 402 (Fig. 4).. If a buffer is 
available, processing continues with step 504 . If not, proqessing continues with step 
518., 

At step 504, Hie pool counter logic 302 determines if the received data packet 
15 is an IP Multicast data packet, The state of the Multicast tag 126 indicates whether 
the data packet is an IP Multicast data packet If the data packet is an JP Multicast 
data packet, the data packet is stor ed once in shar ed memory 108 and a copy of the 
stored data packet is foiwaided to each member of the IP Multicast gr oup.. Hie data 
packet may be foiwaided multiple times to one or more egress poits 1 12 0 . n ,. If the 
20 received data packet is an IP Multicast data packet, processing continues with step 
520. If not, processing continues with step 505, 

At step 505, the received data packet is a non-IP Multicast data packet., 
Thus, a buffet must be allocated from the shared pool 200 oi the poit resei ve pool 
202 0 - 202 n ., The pool counter logic 302 determines if there is a buffer available in 
25 any of the port reserve pools 202 0 - 202 n .. A buffer is available if the reserved pool 
count 432 is greater than zero., The reserved pool count 432 is checked because the 
r eserved pool 202 may be over subscribed., Thus, even if the queue count 434 0 ^ 
associated with the egr ess poit 1 12 0 ^ to which the r eceived data packet is to be 
foiwaided is less than the reseived thr eshold 418, there may not be a buffer- available 
30 in the r eserved pools 202 , If there is a buffer available, processing continues with 
step 506. If not, processing continues with step 508 . 
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At step 506, The pool counter logic 302 deteimiaes if ther e is a buffer 
available in fhe respective port reserve pool 202 0 - 202 n ,. A buffer is available if the 
queue count 434 0 _ n associated with the egress pott 1 12 0 _ n to which the received data 
packet is to be forwarded is less than the reserved threshold 418, If so, processing 
5 continues with step 512,, If not, processing continues with step 508 . 

At step 508, no buffers ar e available in the respective port reserve pool 202 0 - 
202 n of the shared memory 108 , The pool counter' logic 302 determines if there is a 
buffer available in the shared pool 200. A buffer' is available if the shared pool count 
430 is greater than zero.. If so, processing continues with step 510 . If not, processing 
1 0 continues with step 518.. 

At step 510, a buffer' is allocated from the shared pool 200., The pool counter 
logic 302 updates the number of buffers available in the shared pool 200 by 
decrementing the shared free counter 400 through increment-decrement shared 422 . 
Pr ocessing continues with step 514, 
15 At step 512, the pool counter logic 302 updates the number of buffers 

available in the port reserve pool 202 by decrementing the reserved free counter 428 
through increment-decrement reserved 436., Processing continues with step 514 

At step 514, the pool counter logic 302 updates the number of buffers used by 
the egr ess port by incr ementing the port r eserve pool counter 414 through increment- 
20 decrement queue 426, The respective port reserve pool counter 4 14 is selected 
dependent on the egress port 112 selected in the forward vector 1 14 . A buffer for' a 
r espective egr ess port may be allocated fiom the shar ed pool 200 or the r espective 
port reserve pool 202 . Thus : the respective reserved queue count 434 may be gr eater 
than the reserved threshold 418, Processing continues with step 516 
25 At step 516, the pool counter logic 302 updates the total number of fitee 

buffer s in shar ed memory 108 by decr ementing the total Jxee counter 402 thr ough 
increment-decrement total 404. Processing continues with step 526 

At step 518, there is no buffer available in shar ed memory 108 , The egress 
port 1 12 has used all the buffers from the respective port reserve pool 202 and all 
,30 buffers in the shared pool 200 are in use The received data packet is dropped. 
Processing is complete. 
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At step 520, the received data packet is an IP Multicast data packet Thus, a 
buffet must be allocated from the Multicast pool 204. The pool counter logic 302 
determines if there is a buffer available in the Multicast pool 204 . A buffer is 
available if the multicast pool count 420 is greater than zero If so, pr ocessing 
5 continues with step 522., If not, processing continues with step 518 . 

At step 522, the pool counter logic .302 updates the number of buffers 
available in the multicast pool 204 by decrementing the multicast free counter 416 
through increment-decrement multicast 410 . Processing continues with step 524,, 

At step 524, the pool counter logic 302 sets multicast tag 310 to indicate that 
10 the buffei is to be allocated for an 3P Multicast data packet. The multicast tag 310 is 
stored with the pointer to the allocated buffer in shared memory 108 stor ed in an 
egress port queue (not shown) associated with each egress port 1 12 to which the IP 
Multicast data packet is to be forwarded Processing continues with step 526 

At step 526, a buffer 1 is available for allocation, The pool counter logic 302 
15 indicates that a buffer is available for storing the received data packet thr ough 
buffet_ available 308. Processing is complete, 

Fig. 6 is a flowchart illustrating the steps for r eturning a buffer to a pool 
implemented in the pool counter' logic 302 shown in Fig , 3 

At step 600, the pool counter logic 302 waits for 1 notification of a buffei to be 
20 returned to shared memory 108 from the egress ports engine 110., The egress ports 
engine 110 schedules the forwarding of a stored data packet to an egress port 112 
through select 120 forwarded to the packet storage manager- 1 06,. The egr ess potts 
engine 110 includes an egress port queue (not shown) for each egress port 1 12 , The 
egr ess port queue stores pointer's to allocated buffers in shared memory 108, After 
25 the egress ports engine 1 10 has forwarded the data packet, the buffer 1 can be freed. If 
the buffer- is to be freed, processing continues with step 602 . If not, processing 
continues with step 602. 

At step 602, the multicast tag 126 stored with the pointer of the buffer' to be 
fr eed indicates whether the buffer' was allocated from the multicast pool 204 , If the 
30 buffer was allocated from the multicast pool, processing continues with step 605 If ' 
not, processing continues with step 604,. 



NSDOCID: <WO 0209364A2_I_> 



WO 02/09364 



PCT/CAO 1/010 49 



-13- 

At step 604, the pool counter logic 302 determines if the buffer is to be 
returned to the shared pool 200 or' to a port reserve pool 202 . The pool counter logic 
302 examines the reserved queue count 434 associated with the egress port 1 12 to 
which the data packet was forwarded. If the respective reserved queue count 434 is 
5 greater than the reset ved threshold 4 1 8 3 the buffer is to be returned to the shared pool 
200 and processing continues with step 610.. If the respective reserved queue count is 
less than the reserved threshold 41 8, the buffer is to be returned to the respective port 
reserve pool 202 and pr ocessing continues with step 60S 

At step 605, the pool counter logic 302 determines if a copy of the IP 
1 0 Multicast packet has been forwarded to ail members of the IP Multicast group., The 
TP Multicast data packet r eceived at the ingress port 102 (Fig., 1) is stor ed in shar ed 
memory 108 (Fig 1) until the IP Multicast data packet has been forwarded to all 
members of the IP Multicast group After' the last IP Multicast data packet has been 
forwarded, the buffer in shared memory 106 storing the IP Multicast data packet can 
15 be returned to the IP Multicast pool 204 . 

To determine if a copy of the IP Multicast data packet has been forwarded to 
all members of the IP Multicast group, the number 1 of port queues (not shown) that 
the IP Multicast data packet has been queued on is stored in a port queue counter 
register (not shown). The counter register is decremented each time a copy of the IP 
20 Multicast data packet is forwarded to the egress port 1 10 (Big, 1) , The number of 
port queues is determined from the forward vector 114, 

If a copy of the IP Multicast data packet has been forwarded to all members 
of the IP Multicast group, pr ocessing continues with step 606 . If not, pr ocessing 
continues with step 600,. 
25 At step 606, the pool counter logic 302 updates the number of buffers 

available in the multicast pool 204 by incrementing the multicast fr ee counter 416 
through increment-decr ement multicast 41 0 , Processing continues with step 614 . 

At step 608, the pool counter 1 logic 302 updates the number of buffers used in 
the port reserve pools 202 by incrementing the reserved free counter 428 through 
30 increment-decrement reserved 436, Processing continues with step 612. 
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At step 610, the pool counter logic 302 updates the number of buffers 
available in the shared pool 200 by incrementing the shared free counter 400 through 
increment- decrement shared 422 Processing continues with step 612 . 

At step 612, the pool counter logic ,302 updates the number of buffers used by 
5 the egress port by decrementing the respective poit reserve pool counter 414 through 
increment ^decrement queue 426 . The respective port reserve pool counter 414 is 
selected dependent on the state of select forwarded from the egress poits engine 110.. 
Pr ocessing continues with step 614. 

At step 614, the pool counter logic 302 updates the total number of buffers 
10 available in shared memory 1 08 by incrementing the total fiee counter 402 through 
incr ement- decrement total 404 , Processing continues with step 616 . 

At step 616, the buffer is freed by adding a pointer to the buffer to a fiee list. 
Processing is complete . 

While this invention has been particularly shown and described with 
1 5 references to preferred embodiments thereof, it will be understood by those skilled in 
the ait that various changes in form and details may be made therein without 
departing from the scope of the invention encompassed by the appended claims., 



NSDOCID: <WQ 0209364A2 t > 



WO 02/09364 



PCT/CA01/01049 



-as- 



CLAIMS 



What is claimed is: 

1 A switch comprising : 

a reserved pool of buffers in a shared memoiy, the reseived pool of 
5 buffers reserved foi an egress port; 

a shared pool of "buffeis in the shared memory, the shared pool of 
buffers shar ed by a plur ality of egress ports; and 

a pool select logic which selects a free buffer to allocate from the 
reserved pool for storing data received from an ingress port to be forwarded 
10 to the egr ess port, and deallocates the selected buffer after the data has been 

forwarded to the egress port. 

2 A switch as claimed in Claim 1 wherein the pool select logic selects a See 
buffer in the shared pool upon detecting no free buffer in the reseived pool 

3. A switch as claimed in Claim 1 further comprising: 
I 5 a multicast pool of buffers in a shared memory shared by a plurality of 

egr ess ports,. 

4 A switch as claimed in Claim 3 wherein the pool select logic selects a fiee 
buffer- from the multicast pool upon detecting an IP Multicast data packet 
r eceived from an ingress port. 

20 5, A switch as claimed in Claim 3 further comprising: 

a total fiee countei storing the number of free buffers in shared 
memory, the total fiee counter preset to the total number of "buffers in the 
shar ed memory wher ein the sum of the buffei s in the multicast pool, the 
reserved pool and the shared pool is greater than the total number of buffers 

25 in the shared memoiy . 
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6 A switch as claimed in Claim 3 further comprising: 

a total free counter stoiing the number of 'free buffers in shared 
memory, the total free counter preset to the total number 1 of 'buffer's in the 
shared memory wherein the sum of 'the buffers in the multicast pool, the 
5 reserved pool and the shared pool is equal to the total number of buffers in 

the shared memory, 

7 . A switch as claimed in Claim 5 wherein the pool select logic determines the 
availability of a free buffer in the shar ed memory dependent on the number of 
flee buffers in the shared memory stored in the total See counter 

10 8 , A switch as claimed in Claim 6 wherein the pool select logic determines the 
availability of a free buffer in the shared memory dependent on the number of 
See buffers in the shared memory stored in the total free counter. 

9. A switch comprising: 

a reserved pool of buffers in a shared memory, the reserved pool of 
1 5 buffers reserved for an egress port; 

a shar ed pool of buffers in the shar ed niemory, the shared pool of 
buffers shared by a plur ality of egress ports; 

a means for selecting a free buffer 1 to allocate from the reserved pool 
for 1 storing data received from an ingress port to be forwarded to the egress 
20 port; and 

a means for deallocating the selected buffer after' the data has been 
forwarded, 

10. A switch as claimed in Claim 9 wher ein the means for selecting selects a free 
buffer in the shar ed pool upon detecting no free buffer' in the reserved pool , 

25 11,. A switch as claimed in Claim 9 further comprising: 
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a multicast pool of buffers in a shared memoiy shared by a plurality of 
egr ess potts,. 

12 . A switch as claimed in Claim 1 1 wherein the means for selecting selects a 
free buffer from the multicast pool upon detecting an IP Multicast data packet 

5 received from an ingress port. 

13 . A switch as claimed in Claim 12 further comprising: 

means for counting the number of flee buffers in shared memory, the 
means for counting preset to the total number of "buffers in the shared memoiy 
wherein the sum of the buffers in the multicast pool, the reserved pool and the 
1 0 shar ed pool is gr eater than the total number of buffer s in the shared memory . 

14 A switch as claimed in Claim 12 further comprising: 

means for counting the number of flee buffer s in shared memory, the 
means for counting preset to the total number' of "buffers in the shared memory 
wherein the sum of the buffers in the multicast pool, the r eserved pool and the 
1 5 shared pool is equal to the total number' of buffers in the shared memory. 

15,. A switch as claimed in Claim 13 wherein the means for' selecting a fiee buffer' 
determines the availability of a fi ee buffer in the shared memory dependent 
on the number' of fi ee buffer s in the shared memory stor ed in the means for 
counting, 

20 16 A switch as claimed in Claim 14 wherein lire means for selecting a free buffer 
determines the availability of a fiee buffer' in the shar ed memory dependent 
on the number of free buffers in the shared memoiy stored in the means for' 
counting. 

17. A method for managing a shared memoiy in a switch comprising the steps of: 
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providing a reserved pool of buffers in the shared memory, the 
reserved pool of buffers reserved for an egress pott; 

providing a shared pool of buffers in the shared memory, the shared 
pool of buffers shared by a plurality of egress port; 
5 selecting a flee buffer' to allocate from the reserved pool for' storing 

data received from an ingress port to be forwarded to the egress port; and 

deallocating the selected buffer' after the data has been forwarded to 
the egr ess port . 

18 , A method as claimed in Claim 17 wherein the step of selecting selects a free 
1 0 buffer in the shared pool upon detecting no free buffer in the reserved pool 

1 9 A method as claimed in Claim 1 7 further comprising the step of: 

providing a multicast pool of buffeis in a shared memory shared by a 
plurality of egress ports, 

20 A method as c laimed in Claim 19 wher ein the step of selecting selects a fr ee 
1 5 buffer from the multicast pool upon detecting an IP Multicast data packet 

received from an ingress port. 

2 L A switch as claimed in Claim 20 further comprising: 

providing a total free counter for counting the number of free buffer's 
in shared memory, the total flee counter preset to the total number of buffers 
20 in the shared memory wherein the sum of the buffers in the multicast pool, 

the reserved pool and the shared pool is greater' than the total number of 
buffers in the shared memory. 



22 A switch as claimed in Claim 20 further comprising: 

providing a total free counter' for counting the number of free buffers 
25 in shared memory, the total fl ee counter preset to the total number of buffers 
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in the shaied memoiy wherein the sum of the buffers in the multicast pool, 
the reseived pool and the shaied pool is equal to the total number of 'buffers 
in the shaied memoiy. 

23 . A switch as claimed in Claim 21 fuither comprising the step of: 

deteimining the availability of a See buffer in the shared memoiy 
dependent on the number of "free buffers in the shared memoiy stored in the 
total See counter,. 

A switch as claimed in Claim 22 fuither comprising the step of: 

deteimining the availability of a fiee buffer in the shaied memory 
dependent on the number of fiee buffers in the shared memoiy stored in the 
total fiee counter, 



24., 

10 
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